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Abstract — In this paper we propose a new algorithm for 
learning polyhedral classifiers which we call as Polyceptron. It 
is a Perception like algorithm which updates the parameters 
only when the current classifier misclassifies any training data. 
We give both batch and online version of Polyceptron algorithm. 
Finally we give experimental results to show the effectiveness of 
our approach. 

I. Introduction 

Learning polyhedral classifier is an interesting problem in 
machine learning. The need of learning a polyhedral classifier 
for a binary classification problem arise when the positive 
examples are all concentrated in a single convex region with 
the negative examples being all around that region. Even 
when the positive examples are concentrated in a set which 
is approximately convex, learning polyhedral classifier would 
be a useful strategy. For a binary classification problem, a 
given set of examples is polyhedrally separable if there is a 
convex polyhedral set that contains all positive examples and 
no negative example QJ. 

One can learn a classifier for polyhedrally separable data 
using support vector data description (SVDD) method 0, 
which is a variant of the well known Support Vector Machine 
(SVM) method 0. SVDD method does this task by fitting a 
minimum enclosing hypersphere in the feature space to include 
most of the positive examples inside the hypersphere [2| and 
all the negative examples are considered as outliers. In such 
techniques, the nonlinearity in the data is captured simply by 
choosing an appropriate kernel function. Although the SVM 
methods often give good classifiers, with a non-linear kernel 
function, the final classifier may not provide good geometric 
insight on the class boundaries in the original feature space. 
Learning each of the hyperplanes that make up the polyhedral 
set is often useful to understand geometry of the class region 
and the local behavior of the classifier in different regions of 
the feature space. 

Decision tree is another approach which can be used to learn 
polyhedral classifiers. When all positive examples belong to 
a single polyhedral set, the ideal decision tree learnt would 
be such that every non-leaf node has one of the children as 
a leaf (representing negative class) and there is only one path 
leading to a leaf for the positive class. Such a decision tree is 
called a decision list and it represents the polyhedral classifier 
exactly. 



Polyhedral classifier can be learnt using decision tree in 
two ways. The first one is to use one of the top down greedy 
method which is followed in many decision tree algorithms 
ID, Q. In the top down approaches, the impurity based 
heuristics are used to learn optimal hyperplanes at each node. 
But such general decision tree algorithm fails to learn a single 
polyhedral set well. That is, the learnt decision tree may be a 
general tree and not a decision list. 

In the second approach the tree structure is fixed and optimal 
parameters are learnt for the tree. 

For the polyhedrally separable data, the decision tree learn- 
ing can be reformulated as learning a decision list of fixed 
structure. The structure is fixed by assuming that the number of 
hyperplanes that make the required polyhedral set are known 
beforehand. One possible choice for learning such a decision 
list is to formulate a constrained optimization problem |6l, |71, 
ifSl . The objective there is to minimize the classification errors 
subject to the separability conditions. It is worth noting that 
these optimization problems are non-convex even though we 
are learning a convex set. Here all the positive examples have 
to satisfy each of a given set of linear inequalities. Thus the 
constraint on each of the positive examples is logical 'and' of 
the linear constraints and hence form a convex set. However, 
each of the negative examples fail to satisfy one (or more) of 
these inequalities and a priori it is not known which inequality 
each negative example fails to satisfy. Thus constraint on each 
of the negative examples is logical 'or' of the linear constraints 
and hence form a nonconvex set. In a logical 'or' constraint for 
a negative example, identifying which of the linear constraints 
is violated, is called the credit assignment problem in the 
literature and it makes learning polyhedral sets a difficult task 

ID. 

In |6), this problem is solved by first enumerating all 
possibilities for misclassified negative examples (e.g., which 
of the hyperplanes caused each negative example to get 
misclassified and for each negative example there could be 
many such hyperplanes) and then solving a linear program 
for each possibility to find descent direction. This approach 
becomes computationally very expensive. 

If, for every point falling outside the polyhedral set, it 
is known beforehand which of the linear inequalities it will 
satisfy, then the problem becomes much easier. In that case, the 
problem becomes one of solving K linear classification prob- 
lems independently. But this assumption is very unrealistic. 



[8 1 relaxes this assumption a little and assumes that for each 
sub-classification problem corresponding to every hyperplane, 
a small subset of negative examples is known and propose 
a cyclic optimization algorithm (optimizing one classifier out 
of K at a time). Still, their assumption of knowing subset 
of negative examples corresponding to each hyperplane is not 
realistic in many practical applications. 

Recently, a probabilistic discriminative model has been 
proposed in |9| using logistic function to learn a polyhedral 
classifier. It is an unconstrained framework and a simple 
expectation maximization algorithm is followed to learn the 
parameters. But still the approach proposed in (J9) is a batch 
algorithm and there is no incremental variant of this algorithm. 

In this paper we propose a Perceptron like algorithm to learn 
polyhedral classifier which we call Polyceptron. We present 
the Polyceptron criterion which is minimized by Polyceptron 
algorithm. 

The Polyceptron criterion is designed in the same way 
as Perceptron criterion. Perceptron algorithm learns linear 
classifier by minimizing the Perceptron criterion in which 
only misclassified points contribute to the error term. In the 
Polyceptron criterion also, only misclassified points contribute 
to the error term. In other words, it assigns zero error 
for correctly classified points. Since we have modified the 
Perceptron algorithm to learn polyhedral classifiers, we call 
it Polyceptron. 

In this paper, we propose both batch and online version of 
the Polyceptron algorithm. The batch Polyceptron is an al- 
ternating minimization algorithm to minimize the Polyceptron 
criterion. We also present the online Polyceptron algorithm 
which treats one sample at a time. In the online Polyceptron 
the hyperplane parameters are updated only when the current 
example is misclassified. 

The rest of the paper is organized in the following way. 
In section [TT] we discuss polyhedral separability and define 
polyhedral classifier. We describe the Polyceptron algorithm 
in section [Till Experiment results are discussed in section [TV] 
Finally we conclude the paper with some discussions in section 

m 

II. Polyhedral Classifier 

Let D = {(xn,tn) : *n € 3? rf ; *„ E {-1,1}, n = 
1 . . . N} be the training dataset. Let A be the set of points 
for which t n — 1. Also let B be the set of points for which 
t n = —1. First we restate the polyhedral separability defined 
in |LD, 0. 

A. Polyhedral Separability 

Two sets A and B in 5R d are X -polyhedral separable if there 
exists a set of K hyperplanes having parameters (w^, b k ), k = 
I...K with w fe eS d , b k E &, k = l...K such that 

1) wfx + b k > 0, Vxei, k = l...K 

2) Wjjx + b k < 0, V x E B, for at least one k E 
{1,...,K} 

This means that two sets A and B are K -polyhedral separable 
if A is contained in a convex polyhedral set which is formed 
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Fig. 1 . An example of polyhedrally separable sets A and B 



by intersection of K halfspaces and the points of set B are 
outside this polyhedral set. Figured] shows an example of two 
sets A and B that are 3 -polyhedrally separable. 

B. Polyhedral Classifier 

Let w fc = [w fc b k ] T e K d+1 and let x„ = [x„ 1] T e 5R d+1 . 
We now express the earlier inequalities as w^x > and so 
on. Using the definition of polyhedral separability discussed 
earlier, let us define a function /i(x, 0) as below 

h(x, 6) = min (w^x + bk) 
fce{i,...,K} 

where 8 — {wi, . . . , w^} is the set of parameters of the 
K hyperplanes. Clearly, if /i(x, 8) > 0, then the condition 
w^x + bk > 0, V k = 1 . . . K is satisfied and the point x will 
be assigned to set A. Similarly if h(x, 8) < 0, there exists at 
least one k for which w^x + b k < and the point x will be 
assigned to set B. Let us assume that we know K (number of 
hyperplanes forming the polyhedral set). Then the polyhedral 
classifier will become /(x, 8) = sign(/i(x, 8)). 

Now the problem is to learn 8, the set of parameters of all 
the hyperplanes, given the training data. 

III. Polyceptron 

Here we propose a Perceptron like algorithm for learning 
polyhedral classifier which we call Polyceptron. The goal of 
Polyceptron is to find the parameter set 8 = {wi, . . . , %} of 
K hyperplanes such that point x„ G A will have h(x n , 8) = 

min.fc e r 1 x}(w^x„) > 0, whereas point x„ 6 B will have 

h(x n ,Q) = min fcG{1 .. .^ } (w^x„) < 0. Since t„ E {-1,1} 
is the class label for x„, we want that each point x„ should 
satisfy t n /i(x n ,8) > 0. 

Polyceptron algorithm finds polyhedral classifier by mini- 
mizing the Polyceptron criterion which is defined as follows. 
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Where I{t n h(x n ,e)<o} ls an indicator function which takes 
value '1' if t n h(x n ,Q) < and '0' otherwise. Polyceptron 
criterion assigns zero error for a correctly classified point. On 



the other hand, if a point x n is misclassified, the Polyceptron 
criterion tries to minimize the quantity — t n h(x n ,Q). 

A. Batch Polyceptron 

Batch Polyceptron minimizes the Polyceptron criterion con- 
sidering all the data points at a time to find the parameters 
of the polyhedral classifier. Batch Polyceptron works in the 
following way. 

Given parameters of K hyperplanes, wi . . . xvk, define sets 
Sk = {x„|x^Wfc < x^Wj, Vj ^ k) where we break ties by 
putting x„ in the set Sk with least k if x^w^ < x^WjVj ^ k 
is satisfied by more than one k G {1, . . . , K}. The sets Sk are 
disjoint. We can now write Ep(<d) as 



E P (Q) = -^2 ^ *nX^W fc I {t „ 5 T^ 



i<0} 
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(1) 



as 



For a fixed k, - E x „es fc i ™ i nW fe / {f „ i T Wfc<0} is same 
the Perceptron criterion function and we can find w& to 
optimize this by using Perceptron algorithm. However, in 
Ep(Q) defined by (Q3, the sets Sk themselves are function of 
the set of parameters 6 = {wi, . . . , \vk}- Hence we can not 
directly minimize Ep(Q) given by (Q]i using standard gradient 
descent. 

To minimize the Polyceptron criterion we adopt an alter- 
nating minimization scheme in the following way. Let after 
c th iteration, the parameter set be C . Keeping C fixed we 
calculate the sets S^, = {x n |x^w£ < x^w|, Vj ^ k). Now 
we keep these sets S^ fixed. Thus the Polyceptron criterion 
after c th iteration becomes 
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where /£(w fc ) = - Ex 7l es= *n* r fcXrJ{t„*£w*<o}- Super- 
script c is used to emphasize that the Polyceptron criterion 
is evaluated by fixing the sets S%, k = 1 . . .K. Thus Ep(Q) 
becomes a sum of k functions /fc(wfc) in such a way that 
/fc(wfc) depends only on Wk and it does not vary with the 
other Wj, Vj : ^ k. 

Now minimizing Ep{&) with respect to boils down to 
minimizing each of /fc(w/-) with respect to Wfe, For every 
k G {1, . . . , K}, a new weight vector w£ +1 is found using 
gradient descent update as follows. 
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where rj(.) is the step size. Here we have given only one 
iteration of gradient descent. We may not minimize /^(wfe) 
exactly, so we run a few steps of gradient descent to make 
sure that the newly found weight vector w£ +1 is such that 
/fc(^fc +1 ) < /fc(Wfc)- Then we calculate S^ +1 and so on. 



To summarize, batch Polyceptron is an alternating mini- 
mization algorithm to minimize the Polyceptron criterion. This 
algorithm first finds the sets S%, k = 1 . . . K for iteration c 
and then for each k G {1, . . . , K} it learns a linear classifier 
by minimizing /fc(wfe), We keep on repeating these two steps 
until there is no significant changes in the weight vectors. The 
batch Polyceptron algorithm is given below as pseudo code. 



Algorithm 1: Batch Polyceptron 
Input: Training dataset {(xi, j/i), . . . , (xjy, yjv)}, K 
Output: {wi, . . . , wk} 
begin 

Initialize wj, . . . , w^, r/(.), criterion 7, c <— 0; 

while EfcLl??( c )IIE Xn eS=*nXn|| < 7 do 
c <— c+ 1; 
for k ^— 1 to K do 

w£ <- w^ 1 + 77(c) Ex^sr 1 tn * nl 

end 
end 
return wi, . . . , w^; 



end 



B. Online Polyceptron 

In the online Polyceptron algorithm, the examples are pre- 
sented in a sequence. Also at every iteration the algorithm up- 
dates the weight vectors based on a single example presented 
to the algorithm at that iteration. The online algorithm works in 
the following way. The example x c at c th iteration is checked 
to see whether it is classified correctly using the present set 
of parameters O c . If the example is correctly classified then 
all the weight vectors are kept same as earlier. But if x c is 
misclassified and r = argmirj. fce r 1 ^-i(w£) T x c , then only 
w r is updated in the following way. 



W^ +1 






(2) 



Here we have set the step size as '1'. In general, any other 
appropriate step size can also be chosen. The complete online 
Polyceptron algorithm is described in Algorithmf2] 

In the online Polyceptron algorithm, we see that the contri- 
bution to the error from x r will be reduced because we have 
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In the above we have used the fact that x^x c > 0. Although 
the contribution to the error due to x c is reduced, but this 
does not mean that the error contribution of other misclassified 
examples to the error is also reduced. 

As is easy to see that online algorithm is very similar to 
the standard Perceptron algorithm. The original Perceptron 
algorithm is known to converge in finite iteration if the 
training set is linearly separable. However, this does not imply 
that the Polyceptron algorithm would converge if the data is 
polyhedrally separable. The reason for this is as follows: when 
x c is misclassified, we are using it to update the weight vector 
w r , where r is chosen r — argmin fc £ c xjw£. While this may 



Algorithm 2: Online Polyceptron 



Input: A sequence of examples (xi, j/i), . . . , (xjv, j/jv), 

K 
Output: wi, . . . ,Wjc 
begin 

Initialize: w°, . . . , w^-, p = 
while p <num-passes do 
p «-p+ 1; 
for c «- 1 to AT do 

Get a new example (x c , y c ); 
Predict j/ c = sign(/i(x c )); 
Let r = argmin fee{li ...,*:} (w£ -1 ) T x c ; 
if y c ^ 2/ c then 

w£ «- w£ -1 + y c x c ; 



else 



end 



Wfc^-w* -1 . Vfc ^ r ; 



4-wjp 1 , fc = l,...,A-; 



end 

w£ <- wf , k = l,...,K; 



end 



end 

return wi, ... , wr-; 



be a good heuristic to decide which hyperplane should take 
care of x c , we have no knowledge of this. This is the same 
credit assignment problem that we explained earlier. At present 
we have no proof of convergence of the online Polyceptron 
algorithm. However, given the empirical results presented in 
the next section, we feel that this algorithm should have some 
interesting convergence properties. 

IV. Experiments 

To test the effectiveness of Polyceptron algorithm, we test 
its performance on several synthetic and real world datasets. 
We compare our approach with different types of algorithms. 
We compare our approach with OC1 [10] which is generic 
top down oblique decision tree algorithm. We compare our 
approach with a constrained optimization based approach 
for learning polyhedral classifier discussed in [6|. This ap- 
proach successively solves linear programs. We call it PC-SLP 
(Polyhedral Classifier-Successively Linear Program) approach. 
We also compare our approach with a polyhedral learning 
algorithm called SPLA1 [9[. Since the objective here is to 
explicitly learn the hyperplanes that define the polyhedral set, 
we feel that comparisons with other general PR techniques 
(e.g., SVM) are not relevant. 

Dataset Description 

We generate two polyhedrally separable datasets in different 
dimensions which are described below, 

1) Dataset Is 10-dimensional polyhedral set 1000 points 
are sampled uniformly from [—1 l] 10 . A polyhedral set 
is formed by intersection of following three half spaces. 



a) X1+X2+X3+X4+X5+X6+X7+X8+X9+X10 + I > 


b) Xl—X2+X 3 —X4 + X 5 -X 6 +X 7 — X 8 +Xg — Xio + 1 > 



C) Xi + X3 + X5 + Xj + Xg + 0.5 > 

Points falling inside the polyhedral set are labeled as 
positive examples and the points falling outside this 
polyhedral set are labeled as negative examples. The 
number of positive and negative examples sampled are 
493 and 507 respectively. 
2) Dataset 2: 20-dimensional polyhedral set 1000 points 
are sampled uniformly from [—1 l] 20 . A polyhedral set 
if formed by intersection of following four half spaces. 

a) xi + 2x 2 + 3x 3 + 4.X4 + 5x 5 + 6x 6 + 7x 7 + 8x 8 + 
8x9 + 8x w + 20xii + 8x12 + 7xi3 + 6x14 + 5xi5 + 
4xi6 + 3xi7 + 2xis 4- X19 + x 2 o + 20 > 

b) — x\ + 2x2 — 3x3 + 4x4 — 5x5 + 6x6 — 7x7 + 8xs — 
9xg + 15xio — H^ii + 10xi2 — 9xi3 + 8x14 — 
7xi5 + 6xi 6 - 5xi7 + 4xis — 3xig + 2x 2 o + 15 > 

c) Xi + x 3 + x 5 + X7 + 2x 8 + 8x10 + 2xi2 + 3xi3 + 
3xi5 + 3xi 6 + 4xis + 4x 2 o 4- 8 > 

d) xi — X2 + 2x5 — 2x6 + 6x9 — 3xio + 4xi3 — 4xu + 
5xi7 — 5xi8 + 6 > 

Points falling inside the polyhedral set are labeled as 

positive examples and the points falling outside this 

polyhedral set are labeled as negative examples. The 

number of positive and negative examples sampled are 

462 and 538 respectively. 

Apart from these two synthetic datasets, we also illustrate 

the performance of our algorithm on a simple 2-dimensional 

dataset where the polyhedral set is a square. Here the dataset 

is obtained by uniformly sampling from [—2 2] x [—2 2] 

in K 2 and the positive examples are those fully inside 

[—1.1 1.1] x [—1.1 1.1]. This dataset is used only to 

illustrate how the algorithm learns and for this we show how 

the polyhedral set being learnt evolves during the iterative 

optimization procedure. 

We also test Polyceptron on two real world datasets down- 
loaded from UCI ML repository [11] which are described in 
Table H 



Data set 


Dimension 


# Points 


Ionosphere 


34 


351 


Breast-Cancer 


10 


683 



TABLE I 

Details of datasets used from UCI ML repository 



Experimental Setup 

We implemented Polyceptron in MATLAB. In the batch 
Polyceptron there are two user defined parameters, namely 
the step size r](.) and the stopping criterion 7. For our exper- 
iments, we use a constant step size 77 = .1 for all iterations 
and for all datasets. The 7 value used for every dataset is 
specified in Table ITTI In the online Polyceptron, we have one 



user defined parameter num-passes which is an upper bound on 
the number of data passes. Different values of parameter num- 
passes are used for different datasets and are mentioned in 
TableHIl We implemented SPLA1 in MATLAB. In SPLA1, 
the number of hyperplanes are fixed beforehand. We used fixed 
step size gradient ascent(GA) for the maximization steps in 
SPLA1. The step size a for SPLA1 is a user defined parameter. 
For OC1 we have used the downloadable package available 
from Internet [12]. We implemented PC-SLP approach also 
in MATLAB. All the user defined parameters for different 
algorithms are found using ten fold cross validation results. 
All the simulations were done on a PC (Core2duo, 2.3GHz, 
2GB RAM). 

Analysis of Experiments 

We now discuss performance of Polyceptron in comparison 
with other approaches on different datasets. The results pro- 
vided are based on 10 repetitions of 10-fold cross validation. 
We show average values and standard deviation (computed 
over 10 repetitions) of accuracy, time taken and the number 
of hyperplanes learnt. Note that in Polyceptron, we fix the 
number of hyperplanes beforehand. The results are presented 
in Table lU We show results of both batch and online Polycep- 
tron. Table HT1 shows results obtained with SPLA1, OC1 and 
SLP also for comparisons. 

We see that batch Polyceptron is always better than online 
Polyceptron in terms of time ans accuracy. In the online 
Polyceptron at any iteration only one weight vector is changed 
when the current example is being misclassified. After every 
iteration the Polyceptron algorithm may not be improving 
as far as the Polyceptron criterion is concerned. Because of 
this online Polyceptron takes more time to find appropriate 
weight vectors to form the polyhedral classifier. On the other 
hand, batch Polyceptron minimizes the Polyceptron criterion 
using an alternating minimization scheme. And we observe 
experimentally that the Polyceptron criterion is monotonically 
decreased after every iteration using batch Polyceptron. 

We see that the batch Polyceptron performs better than 
SPLA1 in terms of accuracy with a huge margin except on 
Ionosphere dataset. For Ionosphere dataset also the accuracy 
of batch Polyceptron is lesser than the accuracy of SPLA1 by 
a very small amount. Time wise SPLA1 is up to 2.5 times 
faster than batch Polyceptron except on Ionosphere dataset. 
For Ionosphere dataset batch Polyceptron runs faster than 
SPLA1. 

The results obtained with OC1 show that a generic decision 
tree algorithm is not good for learning polyhedral classifier. 
Polyceptron learns the required polyhedral classifier with 
lesser number of hyperplanes compared to OC1 which is a 
generic decision tree algorithm. This happens because we 
have a model based approach which is specially designed 
for polyhedral classifiers whereas OC1 is a greedy approach 
to learn general piecewise linear classifiers. For synthetic 
datasets, we see that accuracies of both batch and online 
Polyceptron are greater than that of OC1 with a huge margin. 
As the dimension increases, the search problem for OC1 



explodes combinatorially. As a consequence, performance of 
OC1 decreases as the dimension is increased which is apparent 
from the results shown in Table [TTJ Also OC1, which is 
a general decision tree algorithm gives a tree with a large 
number of hyperplanes. 

For real word datasets, batch Polyceptron outperforms OC1 
always. We see that for Breast Cancer dataset and Ionosphere 
dataset, polyhedral classifiers learnt using batch Polyceptron 
give very high accuracy. This can be assumed that both these 
datasets are nearly polyhedrally separable. In general, batch 
Polyceptron is much faster than OC1. 

Compared to PC-SLP [6], Polyceptron approach always 
performs better in terms of both time and accuracy. As 
discussed in Section U SLP which is a nonconvex constrained 
optimization based approach, has to deal with credit assign- 
ment problem combinatorially which degrades its performance 
both computationally and qualitatively. Polyceptron does not 
suffer from such problem. 

Thus, in summary, Polyceptron, in general, outperforms 
a generic decision tree method as well as any specialized 
algorithm for learning polyhedral sets (e.g., PC-SLP). 

V. Conclusions 

In this paper, we have proposed a new approach for learning 
polyhedral classifiers which we call Polyceptron. We pro- 
pose Polyceptron criterion whose minimizer will give us the 
polyhedral classifier. To minimize Polyceptron criterion, we 
propose online and batch version of Polyceptron algorithm. 
Batch Polyceptron minimizes the Polyceptron criterion using 
an alternating minimization algorithm. Online Polyceptron 
algorithm works like Perceptron algorithm as it updates the 
weight vectors only when there is a misclassification. We 
see that both the algorithm are very simple to understand 
and implement. We show experimentally that our approach 
efficiently finds polyhedral classifiers when the data is actually 
polyhedrally separable. For real world datasets also our ap- 
proach performs better than any general decision tree method 
or specialize method for polyhedral sets. 
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Data set 


Method 


Accuracy 


Time(sec) 


# hyperplanes 


Datasetl 


Batch Polyceptron (7 = 50) 


94.34±1.11 


0.12±0.001 


3 




Online Polyceptron (num-passes=30Q) 


89.08±0.91 


1.55±0.01 


3 




SPLA1-GA (a = 0.9) 


90.80±4.93 


0.06 


3 




OC1 


77.53±1.74 


6.65±0.87 


22.01 ±5.52 




PC-SLP 


71.26±5.46 


27.70± 10.07 


3 


Dataset2 


Batch Polyceptron (7 = 50) 


94.56±0.90 


0.23±0.53 


4 




Online Polyceptron (num-passes=400) 


94.34±1.96 


1.76±0.16 


4 




SPLA1-GA (a = 1.2) 


90.89±2.99 


0.09±0.01 


4 




OC1 


63.64±1.48 


10.01±0.67 


27.36±6.98 




PC-SLP 


56.42±0.79 


189.91±19.73 


4 


Ionosphere 


Batch Polyceptron (7 = 20) 


89.06±2.22 


0.02±0.002 


2 




Online Polyceptron (num-passes=500) 


81.15±2.66 


0.91±0.03 


2 




SPLA1-GA (a = .3) 


89.65±1.98 


0.06±0.004 


2 




OC1 


86.49±2.08 


2.4±0.11 


8.99±3.36 




PC-SLP 


78.77±3.96 


45.31±35.66 


2 


Breast 


Batch Polyceptron (7 = 20) 


98.49±0.21 


0.12±0.03 


2 


Cancer 


Online Polyceptron (num-passes=500) 


91.93±3.36 


1.72±0.01 


2 




SPLA1-GA 


89.50±5.17 


0.05±0.01 


2 




OC1 


94.89±0.81 


1.52±0.13 


5.82±0.95 




PC-SLP 


83.87±1.42 


22.86±1.07 


2 



TABLE II 

Comparison Results 
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